Device and method for activating with voice input

ABSTRACT

An information processing apparatus that detects a voice command via a microphone in order to activate the device and execute certain applications. The apparatus comprises a digital signal processor (DSP) and a host controller which are responsible for processing the voice commands. The DSP recognizes and processes voice commands intermittently while the host processor is in a sleep state, thereby reducing the overall power consumption of the apparatus. Further, when the DSP is configured to recognize voice commands intended only to activate the device, a memory having a sufficiently lower storage capacity suffices.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No.13/780,010, filed Feb. 28, 2013, the entire contents of which areincorporated herein by reference.

BACKGROUND Field of the Disclosure

This disclosure relates to a method of efficiently operating a devicethrough voice instructions.

Description of the Related Art

Mobile devices, such as smart phones and tablets are examples of devicesthat are adapted to take as input a user's voice (in the form of voicecommands) and perform certain instructions.

Traditionally, when processing a voice command that is intended toexecute a certain application, the processing circuitry of the device(responsible for processing and analyzing the voice input), continuouslymonitors the input such as a microphone or the like, for voice commandsinputted from the user. In doing so, the control circuit of such adevice which executes an application essentially waits for theprocessing circuit to complete its tasks of analyzing the voice input.

In such scenarios, the electric power consumption of the deviceincreases dramatically. Accordingly, there is a requirement to lower thepower consumption in order to utilize the device in an efficient manner.

SUMMARY

Devices and methods for operating the devices via voice commands inorder to lower the power consumption of the device are discussed herein.

According to one exemplary embodiment, the disclosure is directed to aninformation processing apparatus comprising: a microphone configured todetect input voice instructions; a memory configured to store aplurality of keywords, each keyword being associated with apredetermined function of the information processing apparatus; firstcircuitry configured to compare an input voice instruction with theplurality of keywords stored in the memory; and activate secondcircuitry, which is configured to execute an application based on thecomparison.

According to another exemplary embodiment, the disclosure is directed toa method performed by an information processing apparatus, the methodcomprising: receiving input voice instructions from a microphone;storing a plurality of keywords in a memory, each keyword associatedwith a predetermined function of the information processing apparatus;comparing by a first circuitry the input voice instruction with theplurality of keywords; activating a second circuitry, by the firstcircuitry based on the comparing; and executing an application by thesecond circuitry corresponding to the keyword associated with theapplication.

According to another exemplary embodiment, the disclosure is directed toa non-transitory computer-readable medium including computer programinstructions, which when executed by an information processingapparatus, cause the information processing apparatus to perform aprocess comprising: receiving input voice instructions from amicrophone; storing a plurality of keywords in a memory, each keywordassociated with a predetermined function of the information processingapparatus; comparing by a first circuitry the input voice instructionwith the plurality of keywords; activating a second circuitry, by thefirst circuitry based on the comparing; and executing an application bythe second circuitry corresponding to the keyword associated with theapplication.

The foregoing general description of the illustrative implementationsand the following detailed description thereof are merely exemplaryaspects of the teachings of this disclosure, and are not restrictive.

BRIEF DESCRIPTION OF THE DRAWINGS

A more complete appreciation of the invention and many of the attendantadvantages thereof will be readily obtained as the same becomes betterunderstood by reference to the following detailed description whenconsidered in connection with the accompanying drawings, wherein:

FIG. 1 illustrates schematically an exemplary scenario outlining thesleep and active states of a digital signal processor (DSP) and a hostprocessor;

FIG. 2 illustrates schematically an exemplary mobile phone terminaldevice;

FIG. 3 illustrates schematically the structure of a DSP speechprocessing unit;

FIG. 4 illustrates an exemplary non-limiting example of the processingstates in the DSP;

FIG. 5 illustrates the processes conducted by a host processor;

FIG. 6 illustrates an exemplary flowchart depicting a method to wake upa device (i.e., brought to an active state) via a voice recognitionprocess according to one aspect of the present disclosure;

FIG. 7 illustrates the audio (speech) analysis performed by the DSP;

FIG. 8 illustrates an exemplary flowchart depicting a method to wake upa device via a voice recognition process according to a second aspect ofthe present disclosure;

FIG. 9 depicts a non-limiting example of an apparatus (a mobile phone)that transitions to an active state via voice commands according to thepresent disclosure;

FIG. 10 illustrates an exemplary flowchart depicting the steps takenwhile performing a voice pattern analysis;

FIGS. 11A and 11B illustrate an exemplary flowchart depicting a methodto wake up a device by a voice recognition process according to anotheraspect of the present disclosure; and

FIG. 12 illustrates an exemplary flowchart depicting a voice recognitionprocess implemented in the method of FIG. 8 to wake up a device.

FIGS. 13A-13C depict another non-limiting example illustrating theexecution of an application using the voice pattern analysis of FIG. 10.

DETAILED DESCRIPTION

Referring now to the drawings, wherein like reference numerals designateidentical or corresponding parts throughout the several views.

FIG. 1 illustrates a non-limiting exemplary scenario outlining the sleepand active states of a digital signal processor (DSP) and a hostprocessor that are comprised within a device which is configured to takeas input, voice commands (instructions) from a user in order to bring itto an active state. As shown in FIG. 1, the DSP 2, comprises a voicepattern memory 3, which stores user input data i.e., keywords which areused to activate the device. The keywords can be short words such as thename of user or simply commonly used words such as “hello”. Thesekeywords arc stored in the memory by a registration process which isexplained in detail later with reference to FIGS. 6 and 8.

The DSP 2, takes as input voice commands from a user which are inputtedvia a microphone 1. The analog data (user's voice) is modulated using apulse code modulation (PCM) technique or the like and fed in to the DSPfor further processing. The details of this voice processing andanalysis are explained in detail with reference to FIG. 3. Note that DSPis an active state while it receives information from the user. At thisinstant in time, the host processor is kept in a “sleep” state (i.e., asuspended state). The host processor is responsible for executingcertain applications based on the keyword input from a user. Low powerconsumption is obtained by maintaining the host processor in a sleepstate, while the DSP processes the voice commands input by a user.

On receiving the input from the microphone, the DSP compares the inputkeyword with those stored in the voice pattern memory 3. If a match isfound, the DSP notifies the host processor about a successful match viaan interrupt command (discussed later with reference to FIG. 8) andbrings the host processor to an active state. This process is referredto as “waking up” the host processor. The host processor on beingtransitioned to the active state executes a certain application (denotedby AP1-AP4) based on the recognized speech.

In one aspect of the present disclosure, a method of processing thespeech data, wherein the host processor processes the data and the DSPis responsible only for the analysis of the keyword is described. Insuch a setting, note that the keywords are used to wake up the device.The specific execution of an application proceeds later with voicecommands inputted by a user. In another aspect of the presentdisclosure, a method wherein the DSP is responsible for all the speechprocessing is described. Moreover, both embodiments outline as to howlower power consumption is obtained by specifically turning the DSP andthe host processor into sleep modes.

FIG. 2 illustrates a schematic block diagram of an exemplary mobilephone terminal device 100. As shown in FIG. 2, the mobile phone terminaldevice 100 may include an antenna 101 and a wireless communicationprocessing section 102. The wireless communication processing section102 may communicate wirelessly via radio signals, or the like, withother mobile devices via a base station. Further, a data signal, such asa voice transmission from another user, may be received by antenna 101and sent to the wireless communication processing section 102 forfurther processing. In the case of an incoming voice transmission, thevoice data signal may be sent from the wireless communication processingsection 102 to a voice processing section 103. Incoming voice datareceived by the voice processing section 103 via the wirelesscommunication processing section 102 may be output as sound via aspeaker 104.

Conversely, an outgoing voice signal may be supplied by a user to thevoice processing section 103 via a microphone 105. The voice signalreceived via microphone 105 and processed by the voice processingsection 103 may be sent to wireless communication processing section 102for transmission by the antenna 101. The voice processing section 103comprises a digital signal processor (DSP) 103 a which digitizes theincoming analog signal and processes the audio input to detect forkeywords. Keywords enable the operation of device 100, when it isconfigured to operate under the instructions of specific voice commands.These keywords are preset in the device with the aid of a voiceregistration unit and stored in the voice pattern library 103 b. Thedetailed description of this processes is explained later with referenceto FIGS. 5 and 6.

A second antenna 106 may be supplied for use with a short distancewireless communication processing section 107. The short distancewireless communication processing section 107 may communicate wirelesslywith other devices over a network, such as the Internet, a local areanetwork (LAN), or a wide area network (WAN). The second antenna 106 may,e.g., by a Wi-Fi transceiver.

A sensor section 108 may be provided for the mobile phone terminaldevice 100. The sensor section 108 may be a motion sensor that detects amotion of an object in the proximity of the mobile phone terminal device100. The motion may correspond to a user moving an instruction object,such as a finger or stylus, in the proximity of the mobile phoneterminal device 100 for the purpose of selecting data displayed ondisplay 120.

The mobile phone terminal device 100 may include display 120. Thedisplay 120 may be, for example a liquid crystal display (LCD) panel, anorganic electroluminescent (OLED) display panel, a plasma display panel,or the like. The display 120 may display text, an image, a web page, avideo, or the like. For example, when the mobile phone terminal device100 connects with the Internet, the display 120 may display text and/orimage data which is transmitted from a web server in Hyper Text MarkupLanguage (HTML) format and displayed via a web browser. The display 120may additionally display data stored in a memory 150.

A touch panel section 130 can detect a touch operation on the surface ofthe display 120. For example the touch panel 130 can detect a touchoperation performed by an instruction object such as a finger or stylus.Touch operations may correspond to user inputs such as a selection of anicon or a character string displayed on the display 120. The touch panelsection 130 may be an electrostatic capacitance type device, a resistivetype touch panel device, or other such type devices for detecting atouch on a display panel.

The touch panel section 130 may perform processing related to touchoperation classification. For example, the touch panel section 130 mayassign a predetermined function to be performed when a “tap” touchoperation is detected. Similarly, the touch panel section may analyze atouch operation in which the instruction object makes continuous contactwith the display 120 while moving the instruction object around thedisplay 120 (e.g., a “swipe” operation). The touch panel section 130 mayoutput a signal based on a classification of the touch operationperformed. The signal may for example include information indicating thetouch operation classification, the location on the display 120 wherethe touch operation was performed, and the operation to be performedbased on the touch operation.

Data which is detected and processed by the touch panel 130 can betransmitted to a host controller 110. The host controller/processor 110may include one or more processor units and can control each element ofthe mobile phone terminal device 100 based on data detected by the touchpanel 130, or by inputs received from operation key 140. The operationkey 140 may receive inputs, e.g., from external control buttons includedwith the mobile phone terminal device 100. The external control buttonsmay for example control the volume, the power, or a hold operation forthe mobile phone terminal device 100.

The host controller 110 may further execute instructions stored in thememory 150. The controller may further comprise of a DSP driver 111,which is configured to communicate with the DSP 103 a Specifically, thedriver may actuate the DSP during a voice registering phase, or the DSP103 a may initiate communication with the driver upon the successfuldetection of a voice command. The driver 111 may further activate thehost processor to execute a certain application based on the receivedvoice commands. The specific details pertaining to the driver 111 andthe host controller are explained later with reference to FIG. 5. Tothis end, the memory 150 may be a non-transitory computer readablemedium having instructions stored therein for controlling the mobilephone terminal device 100. Further, the controller 110 may include oneor more processors for executing the instructions stored on the memory150.

The mobile phone terminal device 100 can include a control line CL and adata line DL as internal bus lines for communication. The control lineCL can be used to transmit control data from the controller 110. Thedata line DL may be used for the transmission of voice data, displaydata, or the like, throughout the various elements of the mobile phoneterminal device 100.

FIG. 3 illustrates a block diagram depicting the processing of a voicecommand input by a user via the microphone 105. The output of themicrophone is first amplified before further processing by an amplifier103 c. After modulating the input voice (speech) by using a modulationscheme like the pulse code modulation or the like, the signal is inputto an analog to digital converter 103 d. This converter obtains digitalsamples of the input analog voice signal and stores them in a memorybuffer 103 e.

The DSP processor 103 a performs processing on these digitized voicesamples and checks for a match with certain user input keywords whichare stored in a voice pattern memory 103 b. Note that these keywords aresmall words/phrases such as “hello”, the user's name and are used towake up the device. Specific details pertaining to how the keywords arestored in the memory and the process of initiating an applicationexecution will be explained later with reference to FIGS. 6 and 8. Notethat the DSP may perform further processing on the voice samples such asnoise removal; voice compression etc, to further clean and/or makecompact the input voice samples for better processing.

The DSP communicates with the host processor via a control line (CL) anda data line (DL). Specific instructions and control signals areexchanged between the processors by using the CL. Data pertaining to theinputted voice command via the microphone is exchanged with the hostprocessor via the DL. This exchange of information between the DSP andthe host processor is explained in detail with reference to FIGS. 5 and7.

FIG. 4 depicts a non-limiting example of the processing states involvedin the DSP and illustrates the active and sleep cycles of the DSP. FIG.4A depicts a standard analog audio signal that is input to apparatus 100via the microphone 105. In FIG. 4B, a modulation scheme such as pulsecode modulation (PCM) is applied to the analog signal, which is furtherprocessed in an A/D converter to obtain digital samples. As statedpreviously, these digital signals are stored in the buffer memory 103 e.The digital data is read from the buffer memory and transferred to theDSP 103 for processing at every fixed time-period. Note, that as shownin FIG. 4C, the DSP is in an active state only when the data (at everyfixed time period) is read from the memory and transferred to theprocessor. At all other times the DSP remains in a sleep mode, thusenabling the decrease of power consumption.

While being in active state, the DSP reads the data from the buffermemory 103 e and compares (for a match) the input data to the set ofkeywords that are stored in the voice pattern memory 103 b. On finding asuccessful match, the DSP notifies the host processor of the matchedkeyword and brings it to an active state. Note that the host processoris in the sleep state while the DSP performs the comparison process.Further, it must be appreciated that the DSP is in an active state onlywhen data is read from the buffer 103 e for further processing. At allother times when data is not fed from the buffer, the DSP remains in asleep state. Thus it must be appreciated that by alternating from asleep to active state, the DSP operates in an intermittent manner thusproviding a valuable savings in terms of operating power. Further, notethat by keeping the host processor in a sleep state (while the DSPperforms it's processing), and activating it only when a keyword matchis successfully detected by the DSP, a further savings in the powerconsumption of the apparatus 100 is obtained.

FIG. 5 illustrates the processing steps performed by a host processorupon successful keyword match detected by the DSP. As shown in FIG. 5,the DSP 103 includes a voice analysis library 103 x and a voice patternmemory 103 b. The voice command received from the microphone 105 isinput to the voice analysis library wherein the input voice command iscompared with the keywords stored in the voice pattern memory 103 b fora match. Upon a successful match, the DSP instructs (via controlsignals) the DSP driver 111 comprised within the host processor totransition to an active state.

Once in active state, the DSP driver initiates the keyword recognitionprocess by instructing the keyword analysis application 113 to perform asearch for the appropriate application that is to be executed. Note thatthe instruction to perform the application search is sent from aframework unit 112, to the application analysis unit 113 via anapplication interface. On receiving instructions from the DSP driver111, the keyword analysis application searches for the applicationwithin the application set AP1-AP4, for the keyword associated with theapplication. On finding the correct match the appropriate application isexecuted.

Note that in the above process, the DSP 103 performs only a keywordmatch of the input voice command with the keywords registered by a userand stored in the voice pattern memory. Until a successful match isobtained the host processor remains in a sleep mode. On finding asuccessful match, the DSP activates the host controller (processor) viathe DSP driver. The host controller on receiving the keyword from theDSP, runs a search through the application database to search for theapplication associated with the keyword. Thus the process of applicationrecognition is performed by the host controller.

FIG. 6 is a flowchart of the process for waking up the device andoutlines the steps undertaken to recognize an input voice commandaccording to one aspect of the present disclosure. Note that hi thisexample the host processor 110 performs the voice registration processand the input processing of the audio signal. Further we assume for thesake of simplicity that the application intended to be executed by thevoice command is one of making a telephone call.

In step S11, a user inputs a voice command that is assigned to start anapplication. This command is registered in the voice patternregistration application. Note that commands such as “hello” or a user'sname can be assigned to indicate a start of the application execution.In step S12, the voice pattern registration application registers thestart of the specific application (with respect to the input keyword) inthe application launcher.

In step S13, the voice pattern registration application notifies the DSPdriver 111 about the assigned voice pattern, which in turn, in step S14registers the assigned voice pattern in the DSP 103. The DSP storesinformation pertaining to the assigned voice keyword in its voicepattern memory 103 b. This stored information is used later to obtain amatch, when a user inputs a voice command via the microphone 115.

In step S15, the host controller initiates a suspension (sleep) statefor all units other than the DSP 103 a. In step S16, the control unit ofthe host processor brings the DSP driver 111 (comprised within the hostprocessor) to a sleep state. Note (as shown in step S17) that the DSPdriver initiates a start of comparison of voice pattern to the DSP 103a. Once the DSP driver has initiated this start of comparison process tothe DSP, the host controller 101 is completely in a suspension state(step S18).

In step S19, the DSP 101 a takes as input, voice commands from themicrophone 115 and compares the inputted commands to the keywords storedin the voice pattern memory 103 b. Note that this process is performedintermittently as shown in FIG. 4. Specifically, the DSP undergoescycles of sleep and active states depending on the time it receivesinput from the buffer memory.

Step S20 depicts the case wherein the inputted voice command hasresulted in a match with one of the keywords stored in the voice patternmemory. Note that keywords intended to start the apparatus are smallwords/phrases such a “hello” or a users name. On detecting a match, instep S21 the DSP triggers the host processor via an interrupt signal. Indoing so, the host processor which was in a sleep/suspended state istransitioned to an active state. The host processor further triggers theDSP driver 111 to an active state (step S22). In step S23, the DSPdriver initiates the application launcher which in turn signals the DSPdriver to be in a wait state (step S24). Specifically, the applicationlauncher upon transitioning to an active state signals the DSP driver towait for further voice commands from the DSP. Note that at this point oftime no application is yet executed, rather, with the use of a keywordthe DSP and the host processor are activated to receive and processfurther instructions as to which specific application is to be executed.

Steps S25 to S35 illustrate the process from the time an input voiceinstruction is fed to the DSP 103 to the time the host processorexecutes the specific application. In step S25, the user inputs a voiceinstruction such as “make a telephone call” via the microphone 115. Thisinstruction is transmitted from the DSP to the DSP driver in step S26,which in turn invokes the application launcher and notifies it about thespecific voice instruction, as shown in Step S27.

In step S28, the application launcher performs an audio analysis asdescribed in FIG. 5. Based on the content of the data (voice) obtained,the audio analysis application 113 searches for the appropriateapplication to execute. In the example depicted in FIG. 6, we consideran application of making a telephone call. Thus, the applicationlauncher initiates the call application in step S29. In step S30, theapplication launcher notifies the DSP driver to wait for furtherinstructions from the user. For example, the application launcher mayrequest the DSP driver to receive the specific number to be called fromthe DSP.

On receiving this information in step S31, the DSP notifies the driverof the number to call (step S32) which in turn passes the information tothe application launcher (shown in step S33). Finally, the applicationlauncher notifies the call application of the number to call in stepS34, wherein-after the specific application (call application in thisexample) executes the associated task as shown in Step S35.

FIG. 7 illustrates according to another aspect of the presentdisclosure, a method of waking up the device and outlines the stepsundertaken by the host processor when the DSP conducts the processespertaining to voice analysis. Note that in the first aspect of thedisclosure (FIGS. 5 and 6), the DSP performed voice analysis only toactivate the host processor. This was accomplished with the use of akeyword such as “hello” which would initiate (i.e., bring to activestate) the host processor.

In the illustration of FIG. 7, the DSP's voice pattern memory memorizesnot only keywords associated to start the apparatus but also maintains alist of voice instructions required to execute an application. The voiceanalysis unit 103 x, receives input instructions from the microphone105, and compares the instructions with those stored in the voicepattern memory 103 b.

Upon finding a match, the DSP notifies the DSP driver 111 (of the hostprocessor) of the match, through an interrupt command. Note that theinterrupt command varies depending upon the application type to beexecuted. The driver transmits the information received from the DSP toa framework unit 112, which through an application interface (API)executes the appropriate application.

FIG. 8 represents a flowchart of the voice recognition process for theexample described in FIG. 7. For sake of illustration, we consider thesame application, that of making a phone call to explain the steps ofthe process of FIG. 8.

In step S41, a user inputs a voice command that is assigned to start anapplication. This command is registered in the voice patternregistration application. Note that commands such as “hello” or a user'sname can be assigned to indicate a wake up of the processors to furtherexecute a start of the application. In step S42, the voice patternregistration application notifies the DSP driver 111 of the patternregistered to activate the apparatus 100.

In step S43, the DSP driver notifies the DSP of the voice pattern. Note,that in this case, the memory 103 b associated with the DSP stores thevoice pattern for activating the host controller as well as theinstructions that initiates the execution of an application. In stepS44, the host controller initiates a suspension (sleep) state for allunits other than the DSP 103 a. In step S45, the control unit of thehost processor brings the DSP driver 111 (comprised within the hostprocessor) to a sleep state. In step S46, the DSP driver signals the DSP103 to start the process of comparing an input voice pattern with thosestored in the memory of the DSP. At this time instant, the DSP conductsa voice analysis by the method described in FIG. 4. Specifically, theDSP operates in an intermittent fashion wherein it undergoes a series ofperiodic sleep and active states. Note that while the DSP performsprocessing of the input voice instructions (step S48), the hostcontroller is in a suspension state as shown in step S47.

In step S49, once the DSP has confirmed a match between the voiceinstruction inputted from the microphone 105 and the voice patternstored in the memory 103 b, it sends an interrupt signal to thecontroller, thereby indicating the controller to resume an active state(step S50). The controller once activated, instructs the DSP driver instep S51, to transition the active state.

Once the DSP driver is in the active state it selects the application tostart and invokes the corresponding application as shown in Step S52.Note that in this case, we consider a simple application to make atelephone call. Accordingly the DSP driver, in step S53 initiates atelephone call operation. In step S54, the application (for example thecall application) instructs the DSP driver to wait for further inputfrom the DSP 103. This input pertains to the number required by the callapplication in order to make the telephone call. This number is inputtedas shown in step S55 and is notified to the DSP driver in step S56. Thenumber is further transmitted to the call application (step S57) whichfinally makes the telephone call in step S58. Note that in both aspectspresented thus far, the DSP and the host processor are put in a sleepmode when not processing information thereby providing a means to lowerthe power consumption of the apparatus.

FIG. 9 illustrates how an apparatus (e.g. a mobile phone terminal) canbe configured to be activated by voice instructions to perform a certainapplication. We consider an example of transferring data (images, textor the like) from the apparatus 100 to a display terminal 200. Note thatthe example depicts the two devices connected by a wired mechanism.However, the means of connecting this device is in no way limiting theexample. For example, the apparatus 100 and the display terminal 200 maywell be connected by a wireless mechanism to achieve the end objective.

Initially, both the apparatus 100 and the display terminal 200 are in asuspended state. On receiving instructions from a user the apparatus 100makes a transition to the active state. Specifically, the apparatus 100recognizes the voice pattern (keyword) inputted by a user, to bring itin an active state. On transitioning to the active state, the apparatus100 instructs the display to turn in to the active state.

Once the devices are activated, the display terminal 200 displays data121 of the mobile phone terminal on its display screen 201. Hence, inthis manner the display of information from a mobile phone to a displayterminal is accomplished by the use of voice instructions implemented ina manner presented in the present disclosure.

FIG. 10 illustrates an exemplary flowchart depicting the steps takenwhile performing a voice pattern analysis (i.e., voice recognitionprocess). The voice pattern analysis process includes two steps: a voiceverification step and a voice pattern matching step.

In the voice verification step, the voice of a registered (i.e.,authorized) user is first verified. Specifically, an input voice ismatched with a registered voice by verifying, in detail, thecharacteristics of the input voice waveform. Further, the keywordassociated with the voice is also verified. In doing so, a securitymechanism is created wherein unauthorized users are not granted accessto unauthorized devices. For example, if an authorized user hasregistered the word ‘hello’ to activate a mobile phone, only the keyword(‘hello’) spoken by the authorized user will activate the mobile phone.

Once the device is activated by the authorized user, the voice patternmatching step processes further voice instructions. The processing inthis step simply verifies if an appropriate keyword associated with anapplication is input. For example, if a voice instruction “maketelephone call”, is associated with the application of making atelephone call by the mobile phone, in the voice pattern matching stepthe input keyword is verified. Note that access in this step is grantedto all users. Specifically, once the device is activated by theauthorized user, any user can execute an application on the device, aslong as the correct instruction is input to the device.

FIG. 10 illustrates the steps performed by the voice pattern analysisprocess. First, the voice verification process is executed at specifictime instants that correspond to the instants of time the DSP of thedevice is active (as shown in FIG. 4). In step S91, a query is made tocheck if the voice verification is successful. If the response to thequery is affirmative, the process proceeds to step S92, wherein thevoice pattern matching process is performed. If the response to thequery in step S91 is negative (i.e., the voice verification iserroneous) the process simply loops back to step S91.

In step S92 a query is made to check if the voice pattern matching issuccessful. If the response to this query is negative, the process loopsback to step S92, wherein the device awaits the input of the appropriatevoice instruction. If the response in step S92 is successful, theprocess proceeds to step S93 wherein the application associated with theinstruction is executed.

In step S94, a query is made to check if a security time has elapsedfrom the onset of a successful voice verification process. If theresponse to the query is affirmative, the process loops back to stepS91, wherein the device needs to be activated again by the authorizeduser. However, if the response to the query is negative, the processloops back to step S92, wherein the DSP of the device awaits furtherinstructions from a user to execute an appropriate application.

FIGS. 11A and 11B illustrate an exemplary flowchart depicting a methodto wake up a device by a voice recognition process according to anotheraspect of the present disclosure. Note that the flowchart of FIG. 11Bdepict the steps performed after the steps of the flowchart depicted inFIG. 11A are executed. Further, note that the flowcharts of FIGS.11A-11B are similar to the flowchart of FIG. 6, and differ in the voicepattern analysis process. In what follows, we explain the steps of theflowcharts of FIGS. 11A-11B that differ from the steps of FIG. 6.

After the voice pattern registration application registers the start ofa specific application (with respect to an input keyword) in theapplication launcher (step S12), the voice pattern registrationapplication notifies the DSP driver, in step S13, the voice of theauthorized user.

In Step S14, the authorized voice is registered in DSP 103, by the DSPdriver. Note, that in doing so, an unauthorized user is not grantedpermission to activate the device. Further, the flowcharts of FIGS.11A-11C, differ in the manner in which the voice analysis is performed.As shown in step S21, a voice input via the microphone is first verifiedby processing the input voice waveform and matching the voice to that ofthe authorized user. Further, in step S22, the keyword input to activatethe device is analyzed.

Upon a successful match, the device is activated, thus enabling it toaccept further voice instructions (step S23) from any user to execute anapplication. For the sake of simplicity, we consider the sameapplication (as in FIG. 6) of making a telephone call. The further stepsthat execute the telephone call application are similar to those of FIG.6. Similarly, FIG. 12 depicts according to another embodiment of thepresent disclosure a flowchart depicting the steps taken to wake up adevice by using a voice recognition process. Specifically, theflowcharts of FIG. 12 is similar to the flowchart of FIG. 8, and differsin the voice pattern analysis process, which is similar to the voicepattern analysis process as described in FIGS. 11A and 11B.

FIGS. 13A-13C depict a non-limiting example illustrating the executionof an application using the voice pattern analysis process of FIG. 10.

Initially, as shown in FIG. 13A, the display terminal 200 is in aninactive state (OFF). On receiving instructions from an authorized user,the mobile terminal 100 that is connected to the display terminal 200,is brought to an active state. Note, that only the voice of theauthorized user can enable the mobile terminal to transition to anactive state (ON state). A predetermined instruction such as ‘Hello myphone’ can be used by the user to activate the mobile terminal 100, asshown in FIG. 13B.

Upon activating the mobile terminal, an application can be executed byany user. FIG. 13C depicts an example wherein a user intends to displaypictures from the mobile terminal 100 to the display terminal 200. Apre-determined instruction such as ‘Show pictures on TV’ can be used toindicate the execution of transferring pictures to the display terminal.Note that the processing involved in this stage, matches only theinstructions spoken by the user to the keyword associated with theapplication. Specifically, the voice waveform is not processed in detailto match a registered user's voice, thus enabling any user to execute anapplication, provided that the devices have been activated by aauthorized user.

Obviously, numerous modifications and variations of the presentinvention are possible in light of the above teachings It is thereforeto be understood that within the scope of the appended claims, theinvention may be practiced otherwise than as specifically describedherein. For example, the DSP as presented in FIG. 3 may be configured inanother manner, wherein the input data is modulated by mechanisms otherPCM. Further, the application considered in this disclosure was that ofmaking a telephone call. Similarly, other applications which can bevoice activated can be implemented in a manner as disclosed in FIGS. 6and 8. Additionally, devices other than the mobile phone terminal device100 as shown in FIG. 9 may be used to perform the features discussed inpresent disclosure. For example, aspects of the present disclosure maybe executed on a smart phone, a tablet, a general purpose computer, alaptop, an electronic reading device, or other such display terminals.

The above disclosure also encompasses the embodiments noted below.

(1) An information processing apparatus comprising: a microphoneconfigured to detect input voice instructions; a memory configured tostore a plurality of keywords, each keyword being associated with apredetermined function of the information processing apparatus; firstcircuitry configured to compare an input voice instruction with theplurality of keywords stored in the memory; and activate secondcircuitry, which is configured to execute an application based on thecomparison.

(2) The information processing apparatus of (1), wherein the secondcircuitry is in a suspended state when the first circuitry performs thecomparison.

(3) The information processing apparatus of (1), wherein the firstcircuitry is continuously in an active state when the second circuitryis in an active state.

(4) The information processing apparatus of (1), wherein the firstcircuitry activates the second circuitry based on a successful match ofthe input voice instruction with a keyword.

(5) The information processing apparatus of (1), wherein the voiceinstructions are modulated by pulse code modulation before being storedin the memory.

(6) The information processing apparatus of (5), wherein the modulatedvoice instructions are input to the memory at a fixed time period.

(7) The information processing apparatus of (6), wherein the firstcircuitry is in an active state for a fraction of the fixed time period.

(8) The information processing apparatus of (1), wherein the input voiceinstructions include the keyword and further instructions to execute theapplication.

(9) The information processing apparatus of (1), wherein the firstcircuitry activates the second circuitry by an interrupt signal.

(10) The information processing apparatus of (1), wherein the firstcircuitry and the second circuitry exchange data through a plurality ofdata lines.

(11) The information processing apparatus of (1), wherein the firstcircuitry and the second circuitry exchange control information througha plurality of control lines.

(12) The information processing apparatus of (1), wherein the secondcircuitry receives information from the first circuitry regarding asuccessful match of the voice instruction and the keyword and searchesfor the corresponding application associated with the keyword.

(13) The information processing apparatus of (12), wherein the secondcircuitry executes the application upon a successful keyword search.

(14) The information processing apparatus of (1), wherein the firstcircuitry includes a driving unit configured to activate the secondcircuitry.

(15) The information processing apparatus of (1), wherein the firstcircuitry is configured to match an input voice with a registered voiceof an authorized user.

(16) The information processing apparatus of (15), wherein the firstcircuitry is further configured to match an input keyword to apredetermined keyword.

(17) The information processing apparatus of (16), wherein the secondcircuitry is configured to execute the application by any user.

(18) An information processing method performed by an informationprocessing apparatus, the method comprising: receiving input voiceinstructions from a microphone; storing a plurality of keywords in amemory, each keyword associated with a predetermined function of theinformation processing apparatus; comparing by a first circuitry theinput voice instruction with the plurality of keywords; activating asecond circuitry, by the first circuitry based on the comparing; andexecuting an application by the second circuitry corresponding to thekeyword associated with the application.

(19) A non-transitory computer-readable medium including computerprogram instructions, which when executed by an information processingapparatus, cause the information processing apparatus to perform aprocess, the process comprising: receiving input voice instructions froma microphone; storing a plurality of keywords in a memory, each keywordassociated with a predetermined function of the information processingapparatus; comparing by a first circuitry the input voice instructionwith the plurality of keywords; activating a second circuitry, by thefirst circuitry based on the comparing; and executing an application bythe second circuitry corresponding to the keyword associated with theapplication.

1. An information processing apparatus comprising: a microphoneconfigured to detect input voice instructions; a memory configured tostore a plurality of keywords, each keyword being associated with apredetermined function of the information processing apparatus; firstcircuitry configured to compare an input voice instruction with theplurality of keywords stored in the memory; and activate secondcircuitry, which is configured to execute an application based on thecomparison.
 2. The information processing apparatus of claim 1, whereinthe second circuitry is in a suspended state when the first circuitryperforms the comparison.
 3. The information processing apparatus ofclaim 1, wherein the first circuitry is continuously in an active statewhen the second circuitry is in an active state.
 4. The informationprocessing apparatus of claim 1, wherein the first circuitry activatesthe second circuitry based on a successful match of the input voiceinstruction with a keyword.
 5. The information processing apparatus ofclaim 1, wherein the voice instructions are modulated by pulse codemodulation before being stored in the memory.
 6. The informationprocessing apparatus of claim 5, wherein the modulated voiceinstructions are input to the memory at a fixed time period.
 7. Theinformation processing apparatus of claim 6, wherein the first circuitryis in an active state for a fraction of the fixed time period.
 8. Theinformation processing apparatus of claim 1, wherein the input voiceinstructions include the keyword and further instructions to execute theapplication.
 9. The information processing apparatus of claim 1, whereinthe first circuitry activates the second circuitry by an interruptsignal.
 10. The information processing apparatus of claim 1, wherein thefirst circuitry and the second circuitry exchange data through aplurality of data lines.
 11. The information processing apparatus ofclaim 1, wherein the first circuitry and the second circuitry exchangecontrol information through a plurality of control lines.
 12. Theinformation processing apparatus of claim 1, wherein the secondcircuitry receives information from the first circuitry regarding asuccessful match of the voice instruction and the keyword and searchesfor the corresponding application associated with the keyword.
 13. Theinformation processing apparatus of claim 12, wherein the secondcircuitry executes the application upon a successful keyword search. 14.The information processing apparatus of claim 1, wherein the firstcircuitry includes a driving unit configured to activate the secondcircuitry.
 15. The information processing apparatus of claim 1, whereinthe first circuitry is configured to match an input voice with aregistered voice of an authorized user.
 16. The information processingapparatus of claim 15, wherein the first circuitry is further configuredto match an input keyword to a predetermined keyword.
 17. Theinformation processing apparatus of claim 16, wherein the secondcircuitry is configured to execute the application by any user.
 18. Aninformation processing method performed by an information processingapparatus, the method comprising: receiving input voice instructionsfrom a microphone; storing a plurality of keywords in a memory, eachkeyword associated with a predetermined function of the informationprocessing apparatus; comparing by a first circuitry the input voiceinstruction with the plurality of keywords; activating a secondcircuitry, by the first circuitry based on the comparing; and executingan application by the second circuitry corresponding to the keywordassociated with the application.
 19. A non-transitory computer-readablemedium including computer program instructions, which when executed byan information processing apparatus, cause the information processingapparatus to perform a process, the process comprising: receiving inputvoice instructions from a microphone; storing a plurality of keywords ina memory, each keyword associated with a predetermined function of theinformation processing apparatus; comparing by a first circuitry theinput voice instruction with the plurality of keywords; activating asecond circuitry, by the first circuitry based on the comparing; andexecuting an application by the second circuitry corresponding to thekeyword associated with the application.